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ABSTRACT 

Much insight into the interactions of DNA and 
enzymes has been obtained using a number of 
single-molecule techniques. However, recent 
results generated using two of these techniques- 
atomic force microscopy (AFM) and magnetic 
tweezers (MT)— have produced apparently contra- 
dictory results when applied to the action of the 
ATP-dependent type III restriction endonucleases 
on DNA. The AFM images show extensive looping 
of the DNA brought about by the existence of 
multiple DNA binding sites on each enzyme and 
enzyme dimerisation. The MT experiments show 
no evidence for looping being a requirement for 
DNA cleavage, but instead support a diffusive 
sliding of the enzyme on the DNA until an enzyme- 
enzyme collision occurs, leading to cleavage. 
Not only do these two methods appear to 
disagree, but also the models derived from them 
have difficulty explaining some ensemble biochem- 
ical results on DNA cleavage. In this 'Survey and 
Summary', we describe several different models 
put forward for the action of type III restriction 
enzymes and their inadequacies. We also attempt 
to reconcile the different models and indicate 
areas for further experimentation to elucidate the 
mechanism of these enzymes. 

INTRODUCTION 

Recent years have seen the introduction and development 
of sophisticated single-molecule techniques that make 



possible the study of molecular interactions that could 
previously only be investigated using bulk measurements. 
Such techniques are increasing in popularity and are 
reputed to provide greater biochemical insight into 
complex molecular systems, for example molecular 
motors (1,2). While these single-molecule techniques 
would at first appear to be preferable to the less specific 
bulk methods, the data produced can lead to incon- 
sistencies. This might be expected if the differences were 
only between findings made with the bulk and single- 
molecule experiments, but there can also be inconsis- 
tencies between data obtained using the different newer 
techniques. Here, we will address the mechanism of 
action of type 111 DNA restriction/modification (R/M) 
enzymes (3-6). Recent study of this enzyme system has 
indeed led to inconsistencies in data and interpretation. 
These enzymes operate as endonucleases on DNA in vivo 
and in vitro and they have been developed as tools for 
superSAGE (7,8) and for counting the number CAG 
repeats in repetitive DNA (9). Recent results using two 
different single-molecule techniques, atomic force micros- 
copy (AFM) (10-12) and magnetic tweezers (MT) (13,14) 
lead to conclusions that are in disagreement. In this article 
we hope to address this disagreement. 

Type III R/M enzymes, in common with many restric- 
tion enzymes (15,16), work best when two copies of their 
target sequence are present on the same DNA molecule. 
Type III R/M enzymes have asymmetric target sites; for 
example EcoP15I recognizes 5'-CAGCAG-3'. In contrast 
to most other restriction enzymes, the type III R/M 
enzymes generally prefer their target sites to be oriented 
in a head-to-head fashion (17,18) and large site separ- 
ations of up to 3.5 kb between the two sites can still lead 
to cleavage. Two enzyme molecules are needed per double 
strand cut; in other words one enzyme per target sequence 
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is the minimum for cleavage. Cleavage occurs at a defined 
location next to only one of the sites, and which one of 
the two sites is chosen for cleavage is random, although 
this is influenced by the base composition of the DNA. 
The EcoP15I R/M enzyme, for example cuts the sequence 
CAGCAG(N) 2 5/27 as long as the underlined adenine is 
unmethylated. The preference for a two-site substrate 
gives some indication of the complexity of the system 
and indicates that some interaction needs to take place 
between the two copies of the enzyme bound to the 
same DNA molecule for cleavage to take place. Further, 
ATP hydrolysis is essential for the overall restriction 
process. 

The type III R/M enzymes comprise two modification 
(Mod) subunits, each containing a target recognition 
domain (TRD) to bind to the target sequence and a 
methyltransferase catalytic (CAT) domain to monitor 
the methylation status of an adenine in the target (the 
second A in the sequence above for EcoP15I), and two 
restriction (Res) subunits each containing a DNA 
helicase and ATP-hydrolysing (ATP) domain and an 
endonuclease DNA cleavage (NUC) domain (6,19,20). 
They share structural features with other complex R/M 
enzymes such as the type I and type IIB R/M enzymes 
(3,6,21). Several models for the mode of action of type 
III R/M enzymes have been proposed. In this survey we 
describe each model and its faults in turn and then 
conclude with a possible model for the operation of 
these enzymes and suggestions for future experiments. 

Experimental models for the operation of type III 
R/M enzymes 

Translocation, loop extrusion and collision model. Initially 
it was believed that ATP hydrolysis by a type III R/M 
enzyme provided energy to allow translocation of DNA 
so that the enzymes bound at each target site would 
eventually collide (18,19). This is found with other DNA 
translocases such as the type I R/M enzymes (3-5). These 
type I R/M enzymes hydrolyse one ATP for roughly each 
base pair of the many thousands translocated during the 
restriction process (22). They do not turn over in the endo- 
nuclease reaction and loops of DNA are extruded by 
translocation as the enzyme apparently remains bound 
to its target site. In this model, two enzymes eventually 
collide with each other, with DNA cleavage resulting at 
the collision point. Since they also hydrolyse ATP, it was 
assumed that the type III R/M enzymes, in common with 
type I R/M enzymes, would also remain bound at their 
target sites during translocation and that loops of DNA 
would be extruded, Figure 1 . Eventually all of the DNA 
between two target sites would be extruded into loops and 
the enzymes would collide with each other and cleave the 
DNA, but in this case at a defined site close to one of the 
initial recognition sequences (18,19). The translocation 
proceeds only from the 3'-side of the asymmetric target 
site and only the NUC domain in the Res subunit on the 
3'-side of the translocating enzyme is active, to agree with 
the observed head-to-head orientation preference. We 
term this the translocation, looping and collision (TLC) 
model, Figure 1 . Curiously, the type III R/M enzymes use 




Figure 1. The TLC model on a head- to-head substrate showing, from 
top to bottom: binding of enzymes (red pentagons with the active NUC 
domain of the Res subunit located at the leading point of the pentagon) 
to target sites (bold arrows), initiation of translocation to extrude 
loops, and, lastly, collision of the enzymes leading to cleavage at one 
or other target site (flash). Small arrows indicate the movement of the 
DNA. 



only ~ 1 % of the amount of ATP consumed by a type I R/ 
M enzyme (13,14,23,24), but they achieve the same 
biological effect, namely the cleavage of DNA containing 
two copies of the target sequence and a restriction of 
propagation of foreign invading DNA through a bacterial 
population. This ATP hydrolysis, driving DNA transloca- 
tion and bringing about collision of two type III R/M 
enzymes has been presumed for many years to be the fun- 
damental requirement for restriction by type III R/M 
enzymes. However, the way in which translocation 
might be achieved with so little ATP hydrolysis is difficult 
to understand and is a major barrier in transposing the 
TLC model from the type I R/M enzymes to the type III 
R/M enzymes. 

DNA end effects and single site substrates pose mechanistic 
problems. Cleavage of various substrates other than a 
two-site head-to-head circular DNA has been observed. 
The TLC model has difficulty in accounting for this 
cleavage. Effective cleavage of circular single-site sub- 
strates (25), of linear single-site substrates (26-28), of 
linear substrates with two sites in a tail-to-tail orientation 
(14,28) and of substrates where the two sites actually 
contact each other (28) have all been observed. Further- 
more, cleavage of linear versions of the head-to-head 
substrates was enhanced to the levels seen with circular 
substrates if the free ends of the DNA were blocked by 
a streptavidin molecule (14,29). 

Considering only the substrates containing two target 
sites for the EcoP15I enzyme, the DNA sequence leading 

to cleavage is actually CAGCAG(N) 320 o to ~+35oob P GT 

CGTC. The limits of the spacing between the two copies 
of the target sequence are those observed experimentally 
and the negative number of non-specific bp is for a 
tail-to-tail orientation on a linear substrate. Sites 
oriented in a head-to-tail manner are considered to be 
poor substrates, but Raghavendra and Rao (27) note 
that even with this substrate some very limited cleavage 
can be seen to occur. The TLC model does not explain any 
of these observations. Thus new models for the operation 
of type III R/M enzymes became necessary. 
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Figure 2. The reversal model on a single-site substrate. Small arrows 
indicate the movement of the enzyme. Top: enzyme (red pentagon with 
the active NUC domain of the Res subunit located at the leading point 
of the pentagon) binds to target site (bold arrow). Second from top: 
site-specific binding triggers ATP-dependent translocation off the site in 
the 3'-direction. Third from top: collision of the enzyme with an end 
causes the enzyme to turn around. A second enzyme is shown bound at 
the target site. Bottom: collision of this reversed enzyme with an 
enzyme at the original target site initiates cleavage (flash). 

The end reversal model. The cleavage of single-site sub- 
strates, particularly of linear substrates led to the obser- 
vation of 'end' effects and the idea that the translocation 
direction could be reversed once an end was reached (27). 
The enzyme would initially bind and recognize its target 
sequence, commence ATP hydrolysis and vacate the target 
site by tracking along the DNA in the 3'-direction, driven 
by the ATP hydrolysis, Figure 2. Only the NUC domain 
on the 3'-side of the target is active. If the translocating 
enzyme collides with another enzyme bound to a second 
target oriented to face the first target, cleavage will result. 
On reaching a free end on a single-site substrate, the 
enzyme would either fall off or, if the end was blocked, 
reverse its direction and track back along the DNA until 
it collides with another enzyme bound at the target site, 
subsequently producing DNA cleavage. Reversal of the 
translocation direction at the end of a linear DNA 
molecule includes a reversal in orientation of the active 
NUC domain to allow DNA cleavage. If both enzymes 
have translocated away from their target sites, the colli- 
sion will be ineffective and the process will have to be 
repeated. We will refer to this as the 'reversal' model, 
Figure 2. 

At first sight, the reversal model can explain head- 
to-head, tail-to-tail and single-site linear cleavage effect- 
ively. Unfortunately, it cannot explain the cleavage of 
single-site circles, as no end exists to cause reversal. 
Instead, the translocating enzyme would simply run into 
the back of the second enzyme bound at the target site 
and the active NUC domains would not be juxtaposed 
to cleave. Furthermore, the model allows the enzyme to 
reverse when it hits an end and this reversal includes the 
requirement for the active NUC domain to reverse its 
orientation of DNA. Unfortunately, this would lead to 
effective cleavage of head-to-tail substrates, which would 
now be as good a substrate as head-to-head DNA. Since 
cleavage of head-to-tail substrates is very much poorer 
than cleavage of head-to-head substrates, the model 
provides only a partial explanation of the experimental 
observations. 

MT experiments: the random walk sliding model. A similar 
model has been proposed on the basis of MT assays, 
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Figure 3. The MT model on a head-to-head substrate. Small arrows 
indicate the movement of the enzyme. Top: enzymes (red pentagons 
with the active NUC domain of the Res subunit located at the 
leading point of the pentagon) are bound at each target site (bold 
arrows). Second and third from top: the right-hand enzyme diffuses 
off its target site after ATP activation and moves randomly backwards 
and forwards along the DNA. Note that the orientation of the moving 
enzyme on the DNA is maintained. Low levels of ATP hydrolysis keep 
the enzyme on the DNA. The left hand enzyme remains at its target. 
Bottom: the enzyme eventually collides with the left-hand enzyme 
initiating cleavage (flash). If the ends of the DNA are blocked with 
an obstacle (such as streptavidin), the moving enzyme generally does 
not fall off the DNA and cleavage probability is enhanced. 



tethered particle motion (TPM) assays and biochemical 
data (13,14,29,30). Cutting was observed on both 
head-to-head and tail-to-tail linear DNA, particularly 
when the ends were blocked by streptavidin. This cutting 
occurred even when the DNA was under sufficient tension 
to preclude any looping, even if coupled to ATP hydroly- 
sis. Due to the low time resolution of the measurements, 
at low tension, transient looping could not be ruled out. 
The low ATP consumption was proposed to originate 
from an ATP-induced activating 'conformational 
change' when the enzyme was bound at a target site. 
This activation then allowed the enzyme to track along 
the DNA in a one-dimensional random walk without 
further energy consumption, Figure 3. The walk would 
go in either direction and would leave the original target 
site vacant for further enzymes to bind. The random 
walking enzyme could then collide with an enzyme 
bound at a second site and, if the orientation were 
correct, cleavage would occur. Head-to-head sites would 
allow cleavage to occur as the collision would be 'head-on' 
while tail-to-tail sites would require the first enzyme to 
walk past the second site prior to binding of the second 
enzyme. A subsequent collision would then be head-on as 
required for cleavage. Cleavage would occur as long as the 
second enzyme to bind had not commenced its random 
walk and was still at a target site as also proposed in the 
reversal model. We will refer to this model as the MT 
model, Figure 3. 

Although the model ignores structural information on 
the type III R/M enzymes, it implicitly assumes that only 
the NUC domain on the 3'-side of the target is active. The 
diffusion brings the active NUC domain into contact with 
another enzyme still bound at a second target site and 
cleavage will occur if the orientation correctly brings 
the active NUC domains in the two enzymes into 
contact. If this second enzyme has moved off the target 
site, then the whole process is ineffective and has to be 
repeated, as in the reversal model. The MT model explains 
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cleavage of head-to-head and tail-to-tail substrates very 
effectively, even if the sites are adjacent, as it does not 
allow the enzyme to change the orientation of the active 
NUC domain on the DNA when an end is hit. On the 
other hand, because of this requirement, the model is 
unable to explain the good cleavage of single-site sub- 
strates, as two active NUC domains would never come 
into contact: one enzyme would run into the back of the 
other and the active NUC domains would not contact 
each other. This is true for both linear and circular 
single-site DNA molecules, as one would always obtain 
only head-to-tail collisions. Hence this model is also 
incomplete. 

The transient looping and translocation model from 
AFM. AFM showed that in the presence of ATP, 
EcoP15I could cause DNA to form loops held together 
at their apex by one or two copies of the enzyme, depend- 
ing upon whether one or two copies of the target sequence 
were present (10-12). On a linear molecule with a single 
target site, a single copy of the enzyme bound to the target 
site could capture another segment of the DNA molecule, 
resulting in one or two DNA loops. The lengths of the 
loops were in excellent agreement with well characterized 
theories of DNA flexibility and diffusive looping, rather 
than having sizes dictated by DNA translocation driven 
by ATP hydrolysis (12). The apex of the loop usually 
included the target sequence, but occasionally the 
specific target sequence was not at the apex, implying 
that the enzyme could move off the target sequence after 
the formation of the loop. It might also be proposed to 
bind two non-specific sites and form a loop, but since no 
loops were observed on substrates completely lacking 
target sites, this possibility can be discounted. On 
two-site substrates, multiple loops were observed 
emanating from a single apex. The apex apparently had 
two copies of the enzyme bound, as also observed for type 
I R/M enzymes (31), and the loops were again of the size 
expected from DNA flexibility. On neither substrate were 
additional single enzyme molecules observed, contrary to 
what one would expect from the MT model. Fast-scan 
AFM further showed the real-time formation and 
breaking of loops and the introduction of supercoiling 
due to translocation past the enzyme, as in the TLC 
model, once DNA was stably trapped in a loop (11). 
The observation of supercoiling correlates with the bulk 
observation that supercoiling influences cleavage efficiency 
(19). A model was proposed showing how the initial for- 
mation of loops via diffusion and segment capture, which 
would shorten the distance between two target sites, could 
subsequently reduce the amount of translocation required 
for inter-site collision of the NUC domains and the 
amount of ATP required (12), Figure 4. Since two-site 
substrates showed an enzyme dimer at the apex of 
multiple loops, the model also incorporated a specific di- 
merisation of the R/M enzyme and a probable enzyme 
structure which would account not only for the number 
of loops but also detect the orientation of the two target 
sites and whether they were on the same DNA molecule. 
We will refer to this as the AFM model, Figure 4. 




Figure 4. The AFM model on a head-to-head substrate. Small arrows 
indicate the movement of the DNA. Top: enzymes (red pentagons with 
the active NUC domain of the Res subunit located at the leading point 
of the pentagon) are bound at each target site (bold arrows). Middle: 
ATP activates both enzyme:enzyme dimerisation (left hand side) and 
the trapping of loops via the flexibility of the DNA (right hand side). 
Loop A results from the dimerisation of the enzymes while the other 
loops result from trapping of DNA segments between the Mod subunit 
and the Res subunit. By this mechanism the inter-site distance is 
reduced prior to further translocation. Bottom: translocation, driven 
by ATP hydrolysis, leading to contraction of loop A in the 
head-to-head complex, pulls the loop taut between the two 
translocating Res subunits. It is postulated that this is the trigger for 
cleavage (flash) by the two NUC domains. 



For this model it was suggested that the translocation 
movement would be directional and driven by ATP hy- 
drolysis just as in the TLC model (12). However, we note 
that a one-dimensional random walk diffusion, activated 
by ATP hydrolysis as in the MT model, could be an 
alternative way of changing the size of the loops (30), 
although this would not account for the observed super- 
coiling (11,19). 

The AFM model works for two-site substrates in ex- 
plaining the cleavage of head-to-head and tail-to-tail sub- 
strates rather than head-to-tail substrates. It also allows 
discrimination of sites in trans as the protein-protein 
interaction is responsible for detecting the site orientation. 
The observation of looping on single-site substrates, which 
can be cleaved, can also be explained if the enzyme is 
allowed to become activated and undergo inter-segment 
transfer on the DNA, thereby clearing the target site to 
bind a second enzyme. However, in common with the 
other models there are weaknesses. In this case dimers 
were not observed by AFM on single-site DNA so this 
model cannot explain cleavage of such substrates. The 
AFM model also does not work for situations where the 
two target sites are adjoining, as there is insufficient space 
on the DNA. Hence, once again the model cannot be 
complete. 

Perhaps supporting the AFM model as a plausible 
mechanism is the fact that looping to bring two target 
sequences in cis together is a feature of a substantial 
number of restriction enzymes (15,16) and has often 
been observed using AFM [e.g. (31,32)] and electron mi- 
croscopy [e.g. (33)]. The loops often require enzyme 
'dimers'. Single-molecule methods such as MT and TPM 
assays have also led to the observation of looping (32, 
34-36). In some cases this looping is very obvious, but 
sometimes the forces generated on DNA by the technique 
reduce looping to very low levels so that it is hard to 
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observe without special care, especially if the stability of 
the loop is low. For example the loops formed by Ecll8kl 
type II restriction enzyme were obvious via AFM, but 
formed in only a few percent of events in a TPM assay 
(32). Biochemical evidence is also derived from measure- 
ments of the enzyme molecular weight in the presence and 
absence of DNA (enzyme 'dimers' may be stable in the 
absence of DNA or require DNA binding to be stable), 
and from a dependence of cleavage of a long piece of 
DNA on the presence of a second short duplex, itself con- 
taining a target site [e.g. (37)]. Such an enhancement of 
cleavage has been observed for the type III R/M enzyme 
EcoPI, although in this case the cleavage occurred on a 
long piece of DNA lacking any target sites and was ap- 
parently an example of a 'secondary' cleavage activity of 
the enzyme (29). 

A potential reconciliation of the models and conclusions 

It should now be clear that none of the models, TLC, 
Reversal, MT or AFM, fit all current data. So is there 
some way to incorporate all the data in a model of oper- 
ation encompassing the biochemical data but also 
resolving the discrepancies revealed by AFM, a predom- 
inantly 'structural' method and MT, a predominantly 
'kinetic' method, bearing in mind that DNA cleavage 
under high applied force indicates that a requirement for 
looping can be circumvented? 

The MT model has been put forward most forcefully by 
Szczelkun et al. (30), but the model disregards the loops 
observed by AFM and other biochemical evidence for 
looping, such as the obvious cleavage of single-site 
circular DNA (25,29) and the observation that cleavage 
is hindered by supercoiling of the DNA (19). The absence 
of apparent looping in the TPM assay when applied to the 
type III R/M enzymes is also cited as support for the MT 
model (13,14), but it is apparent that the data are not of 
sufficient resolution, when compared with the data 
obtained for the Ecll8kl type II restriction enzyme, to 
rule out looping (32). It is important to note that DNA 
looping and one-dimensional diffusion along a DNA 
contour are both fast events occurring on the millisecond 
timescale, much faster than the time resolution of the 
assays currently employed, be they single molecule or 
ensemble assays (1,38,39). 

The AFM model proposes that the observed loops are 
formed via the natural flexibility of DNA and that ATP 
hydrolysis then drives their expansion or contraction until 
they cannot change further, at which point cleavage is 
induced. If instead one proposes that after their initial 
formation, the loops expand or contract via randomly 
diffusing backwards and forwards through the enzyme 
in a process maintained via ATP hydrolysis then one 
could link the AFM and MT models together to explain 
the data on two-site substrates. 

A key question is why do the MT experiments not 
reveal loops? It is worth highlighting that in all of the 
proposed models, cleavage is preceded by a specific 'di- 
merisation' of two enzyme molecules in such a way as to 
position two active NUC domains together. So the 
question is really when does this dimerisation occur? For 



two-site substrates, we suggest that the AFM surface 
enhances the ability of an EcoPI 51 bound to its target 
site to stick to another EcoPI 51 bound at a second site, 
noting that such collisions are inevitable and rapid on any 
long piece of DNA in three and two dimensions, as 
long as the DNA is not under tension (35,36,39). The 
inter-segmental collision is indeed one of the fastest 
ways to communicate between sites on DNA, especially 
when other DNA-binding proteins are bound along the 
DNA contour (38,40). If the kinetics of this collision inter- 
action, including its sticking to the AFM surface, are 
faster than the kinetics of activating the sliding mechan- 
ism, which allows the enzyme to leave its site, then the 
AFM will predominantly see these looped complexes, 
whereas the MT experiments will predominantly see the 
sliding complexes. Thus the MT and AFM data may be 
seeing the same pathway but with different aspects being 
highlighted. 

Thus, reconciliation between the AFM and MT models 
is possible for the cleavage of substrates with two target 
sites but there is clearly also a need for further experimen- 
tation on the cleavage of single-site substrates, as this is 
still unexplained, and on the structural domains and inter- 
faces present in the enzyme. Further assays could include 
an examination of whether a type III enzyme bound to its 
target would interfere with cleavage by a type II restriction 
enzyme that had an overlapping target site. The MT 
model would suggest that little hindrance would occur 
as the type III enzyme would diffuse away to leave the 
site for the type II restriction enzyme clear of obstruction. 
The AFM 'dimerisation via looping' model could be 
tested by varying the spacing between two target sites, 
as it would predict a 10 bp periodicity for the ability 
of the two enzymes to dimerize. Future experiments on 
type III restriction enzymes might also include a direct 
visualisation of fluorescently-labelled enzymes diffusing 
on hydrodynamically stretched DNA (1,2) and more 
sophisticated dynamic looping experiments using higher 
time resolutions (here we note the potential of new 
CMOS-based single photon imagers (41) with microsec- 
ond resolution). Perhaps the most useful in vitro experi- 
ments to perform now are an analysis of the multiple 
DNA binding sites on each copy of the enzyme and 
further study of the three-dimensional structure, particu- 
larly the 'dimerisation' and subunit interfaces, of these 
complex enzymes to constrain speculations on their 
mode of operation. 

In addition, any more complete model needs to give 
further consideration to the target situation in vivo and 
specifically whether the foreign DNA is naked, i.e. 
devoid of other non-sequence-specific proteins (40,42), 
stretched out or coiled up. All of these factors will influ- 
ence the ability of the restriction enzymes, and not just 
type III enzymes, to find and bind their target sequences 
and ultimately to protect the cell from invasion. 
Furthermore, the model of action has to be fast enough 
to prevent replication of the invading foreign DNA 
because the R/M system will generally only have 
~lmin, if one considers the life cycle of, for example a 
lytic phage, in which to act on foreign DNA. 
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